Hybrid Algorithm, to segment Character in Gurmukhi Handwritten Text, with a Comparative Study
نویسندگان
چکیده
Immense research has been done on optical character recognition in the last few decades. The desire to make scanned text document as an editable document, forces the investigators to think about the optical character recognition (OCR). The process of recognizing a segmented part of the scanned image as a character is OCR. This process consists of three major sub processes pre processing, segmentation (the most crucial process) and then recognition. The incorrect segmentation can not lead to correct results; it is just like garbage in and garbage out. In case of handwritten document the situation is more difficult, because in that case, only few points are there which can be used to make segmentation. In this paper, we formulate an algorithm to segment the scanned document image as a character. As per the proposed algorithm, one part is extracted from the word present in the line. This extracted part is checked whether it has some meaningful symbol (as per Gurmukhi script). If it has then the extracted part is marked and written in the file, otherwise the extracted part is readjusted to find the symbol. This concept was implemented. These results were compared with other algorithms and got good reasonable results.
منابع مشابه
Segmentation of Broken Characters of Handwritten Gurmukhi Script
Character Segmentation of Handwritten Documents has been an active area of research and due to its diverse applicable environment; it continues to be a challenging research topic. The desire to edit scanned text document forces the researchers to think about the optical character recognition (OCR). OCR is the process of recognizing a segmented part of the scanned image as a character. OCR proce...
متن کاملFeature Extraction and Classification Techniques in O.C.R. Systems for Handwritten Gurmukhi Script – A Survey
Optical character recognition (OCR) is very popular research field since 1950’s. A great work has been done for various scripts particularly in case of English. But in case of Indian scripts the research is limited. This paper presents an overview of the various O.C.R. systems for gurmukhi which are developed for handwritten isolated gurmukhi text. In case of printed gurmukhi text a lot of rese...
متن کاملA Study of Touching Characters in Degraded Gurmukhi Text
Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper a study of touching Gurmukhi characters is carried out and these characters have been divided into various categories after a careful analysis. Structural ...
متن کاملSegmentation Problems and Solutions in Printed Degraded Gurmukhi Script
Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper we have proposed a complete solution for segmenting touching characters in all the three zones of printed Gurmukhi script. A study of touching Gurmukhi cha...
متن کاملOffline Handwritten Gurmukhi Character Recognition: A Review
All over India more than 12 crore people utilize Gurumukhi script for speaking, documenting & other purposes. A considerable advancement in the work associated with the recognition of handwritten and printed Gurmukhi text has been reported in last few years. From the last few decades offline handwritten character recognition has gained a lot of interest of researchers. It is well known that eac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011